-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suppress non-speech-related token outputs #473
Suppress non-speech-related token outputs #473
Conversation
I was suggesting this also. Good idea. However, shouldn't this be controlled by a bool in whisper_full_params ? |
Your point is right, I will add the parameter to be able to control suppression. |
|
This seems to be causing hallucinations in the end of the audio, e.g.: "Thank you for listening" "Amen" ..
* add non-speech-token suppression * add suppress non-speech_tokens param
…#473) This seems to be causing hallucinations in the end of the audio, e.g.: "Thank you for listening" "Amen" ..
* add non-speech-token suppression * add suppress non-speech_tokens param
…#473) This seems to be causing hallucinations in the end of the audio, e.g.: "Thank you for listening" "Amen" ..
* add non-speech-token suppression * add suppress non-speech_tokens param
…#473) This seems to be causing hallucinations in the end of the audio, e.g.: "Thank you for listening" "Amen" ..
* add non-speech-token suppression * add suppress non-speech_tokens param
…#473) This seems to be causing hallucinations in the end of the audio, e.g.: "Thank you for listening" "Amen" ..
* add non-speech-token suppression * add suppress non-speech_tokens param
…#473) This seems to be causing hallucinations in the end of the audio, e.g.: "Thank you for listening" "Amen" ..
* add non-speech-token suppression * add suppress non-speech_tokens param
…#473) This seems to be causing hallucinations in the end of the audio, e.g.: "Thank you for listening" "Amen" ..
Problem
Change
(https://github.com/openai/whisper/blob/7858aa9c08d98f75575035ecd6481f462d66ca27/whisper/tokenizer.py#L224-L253)